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Abstract 

Let ti,... ,tni S and pi,... ,Pn^ S and consider the bipartite location recovery prob¬ 
lem: given a subset of pairwise direction observations {{U —pj)/\\ti — Pj\\2}i,jelne]xlns]j where 
a constant fraction of these observations are arbitrarily corrupted, find and {Pj}j^[n^] 

up to a global translation and scale. We study the recently introduced ShapeFit algorithm as a 
method for solving this bipartite location recovery problem. In this case, ShapeFit consists of 
a simple convex program over d{ni + Us) real variables. We prove that this program recovers 
a set of ni + Ug i.i.d. Gaussian locations exactly and with high probability if the observations 
are given by a bipartite Erdos-Renyi graph, d is large enough, and provided that at most a 
constant fraction of observations involving any particular location are adversarially corrupted. 
This recovery theorem is based on a set of deterministic conditions that we prove are sufficient 
for exact recovery. Finally, we propose a modified pipeline for the Structure for Motion problem, 
based on this bipartite location recovery problem. 


1 Introduction 

Structure from Motion (SfM) is the task of recovering 3d structure from a collection of images taken 
from different vantage points [B]. In the SfM problem, camera poses are represented by locations 

E M^, i = 1 ... rii and rotation matrices Ri E SO{‘i),i = 1 ... nr, where Ri maps coordinates in 
the frame of the zth camera to the world frame. For a generic structure point p E there exists 
a unique point in each imaging plane given by perspective projection. A pair of image points is 
said to correspond when they are both projections of the same point in 3d space. Given enough 
point correspondences between a pair of views, epipolar geometry yields the relative rotation and 
direction between those views. Pairwise relative camera poses can then be used to estimate the 
individual poses Ri),i = 1... up to a Euclidean transformation. Knowledge of camera poses 
and point correspondences allows one to estimate 3d structure via triangulation. Finally, the pose 
and structure estimates are used as initialization for bundle adjustment, which is the simultaneous 
nonlinear refinement of structure and camera poses. In summary, SfM typically consists of four 
steps: 1) identify point correspondences; 2) recover camera orientations and locations in global 
coordinates; 3) triangulate structure points using estimates of camera pose and correspondences; 
and 4) perform bundle adjustment. 

A central difficulty of SfM is that point correspondences are prone to errors because they are 
found purely by local photometric information, which is subject to projective transformations from 
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camera motion, specularities, occlusions, variable lighting conditions, shadows, and repetitive struc¬ 
tures commonly found in manmade scenes. Thus, every step of the above SfM pipeline needs to 
tolerate highly corrupted input data. For the correspondence step, techniques such as Random 
Sampling Consensus (RANSAC) are used to reduce the number of outliers among candidate cor¬ 
respondences initially obtained by brute-force photometric matching. Unfortunately, even after 
applying RANSAC, outliers in point correspondences are generally unavoidable. 

Mathematically, once a set of correspondences has been established, the SfM problem can be 
formulated as the d = 3 case of the following. Let be a collection of distinct vectors 
..., tn^ G and let be a collection of Us distinct vectors ... ,pn] G Associated 
to locations is a set of orientations R = G SO{d). The pairs {tf'\Ri) represents 

poses from which observations of the points p® are collected. Let G{n^,ns,E) be a bipartite 
graph on vertices, where E = EgU with Ef, and Eg corresponding to pairwise direction 

observations that are respectively ‘corrupted’ and ‘uncorrupted.’ That is, for each ij G E, we are 
given a vector Vij, where 




for ij G Eg, 


Vij G ^ for ij G Eb. 


An uncorrupted observation Vij is exactly the direction of R\ (t ® a corrupted observation 

is an arbitrary direction. Consider the task of hnding the unknown locations and structure 
points up to a global translation and scale, and the orientations R, up to a global rotation, 
without knowledge of the decomposition E = Eg \J Eb, nor the nature of the corruptions. 

Estimating camera orientations Ri from from corrupted relative rotations RjRj is a tractable 
and relatively well-understood problem. For instance, a method based on Lie group averaging 
performs well in practice [3], and a semidehnite program based on lifting and least unsquared 
deviations (LUD) has rigorous guarantees of exact recovery from corrupted relative rotations [11] . 
Once camera orientations are estimated, one can use epipolar geometry to obtain a set of relative 
direction estimates of camera locations. These estimates are partially corrupted since they are 
computed from the initial point correspondences. Camera locations in a global reference frame can 
be estimated using the IdSfM approach of [12], which screens for outliers based on inconsistencies 
in Id projections; however, this approach is not robust to self-consistent outliers. Alternatively, 
locations can be found by recent methods such as LUD |7] or the ShapeFit algorithm Hj, which are 
both convex programs. It was proven in |4] that ShapeFit recovers locations exactly from partially 
corrupted pairwise directions under broad technical assumptions. 

Having obtained an estimate of camera orientations and locations, one can recover an estimate 
of the 3d structure by triangulation, for instance by minimizing the quadratic reprojection error 
or maximizing a likelihood estimate. Bundle adjustment then proceeds by jointly optimizing this 
reprojection error or likelihood estimate with respect to camera poses and 3d structure. It is 
important to initialize bundle adjustment close to the global minimum, because it is non-convex 
and susceptible to getting stuck in local minima. 

In this paper, we consider compressing two sub-steps of the pipeline — camera location recovery 
and structure recovery by triangulation — into one provably corruption-robust step based on the 
ShapeFit algorithm. Namely, once camera rotations are estimated, our approach uses the raw 
image coordinates of point correspondences to recover the camera locations and structure points 
simultaneously. If a structure point pj is visible to a calibrated camera at location ti, then its 
image coordinates under perspective projection provide a vector Vij that has the same direction as 
—Pj)- If the orientation Ri is known and accurate, then the direction of ti —pj is also known. 
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Equivalently, if all the orientations Ri are known, we can take each Ri to be the identity without 
loss of generality. When a point correspondence is incorrect, the estimated direction of ti — pj can 
of course be arbitrarily corrupted. We thus arrive at the following recovery problem. 

With and defined as above, for each ij G E, we are given a vector Vij, where 


^ inh)—ikir ^ ^ ^ ^ ^b- 

11^1 -Pj II 2 


( 1 ) 


Thus, an uncorrupted observation Vij is exactly the direction of ^ corrupted obser¬ 
vation is an arbitrary direction. The task is to find the unknown locations up to global 

translation and scale, without knowledge of the decomposition E = Eg \J Eb, nor the nature of the 
corruptions. 

To summarize, we propose the following modified pipeline for Structure from Motion: 1) estab¬ 
lish point correspondences; 2) estimate global orientations of the cameras; 3) estimate the camera 
locations and structure points simultaneously; and 4) run bundle adjustment. 

We will show that ShapeFit, a tractable convex program, can exactly solve the recovery problem 
in Step 3 under broad deterministic assumptions and under a random model. In [3], the present 
authors showed that ShapeFit recovers camera locations exactly from corrupted pairwise direction 
under suitable assumptions. The result in [3] strongly relies on the existence of triangles in the 
graph of observations, whereas in our present setting, the underlying graphs are bipartite and 
necessarily do not contain triangles. In this bipartite setting, we will prove a deterministic recovery 
result for ShapeFit based on the presence of cycles of length 4. We also show that under a random 
Gaussian and Erdos-Renyi model, ShapeFit recovers structure and locations exactly from known 
orientations and corrupted correspondences with high probability in the high dimensional case. To 
the best of our knowledge, these are the first theoretical results guaranteeing exact location and 
structure recovery from corrupted correspondences and known orientations. 


1.1 Problem formulation 

The location recovery problem is to recover a set of points in from observations of pairwise 
directions between those points. Since relative direction observations are invariant under a global 
translation and scaling, one can at best hope to recover the locations and 

structure points P^^'^ = ... ,Pn^} up to such a transformation. That is, successful recovery 

from {%'}(ij)gE is finding two sets of vectors + '^)}je[ns] for some w 

and a > 0. We will say that two pairs of sets of vectors {T,P) and pi^)) are equal up to 

global translation and scale if there exists a vector w and a scalar a > 0 such that ti = -|- w) 

for all i G [rig] and pj = -|- w) for all j G [ng]. In this case, we will say that {T,P) and 

( 21 ( 0 )^p(o)) j^g^yg ^}^g game ‘shape,’ and we will denote this property as {T,P) ~ The 

location recovery problem is then stated as: 

Given: G{ni,ns,E), {vijjij^E satisfying ([T]) 

Find: r = {fi,...,t„J GM'^^^^P = {pl,...,p„J such that {T, P) ^ {T^^\ P^^^) 

For this problem to be information theoretically well-posed under arbitrary corruptions, the 
maximum number of corrupted observations affecting any particular location ti must be at most 
Similarly, the maximum number affecting any particular structure point pj must be at most 
Otherwise, suppose that for some location of structure point p^J‘\ half of its associated 
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observations Vij are consistent with and the other half are corrupted so as to be consistent 
with some arbitrary alternative location w. Distinguishing between and w is then impossible 
in general. A similar argument follows for some structure point ■ Formally, let degfj{ti) be the 
degree of location ti in the graph G{ni,ns, Ef,) and let deg^(pj) be the degree of structure point 
Pj in the graph G{n£,ns, Ef,). Then, well-posedness under adversarial corruption requires that 
maxig[„^] degb(ti) < and rnax^gj^^] degh{vj) < yus, for some y < 1 / 2 , 

Beyond the above necessary degree condition on Eg for well-posedness of recovery, we do not 
assume anything about the nature of corruptions. That is, we work with adversarially chosen 
corrupted edges E^ and arbitrary corruptions of observations associated to those edges. To solve 
the location recovery problem in this challenging setting, we utilize the convex program called 
ShapeFit [1]: 


nt ris 

min ^ \\P^± {ti - Pj )\\2 subject to ^ {U - Pj,Vij) = 1, '^k + '^pj = 0 (2) 

iPjijelns] 

where P.a is the projector onto the orthogonal complement of the span of va- 

This convex program is a second order cone problem with d{ni + Ug) variables and two con¬ 
straints. Hence, the search space has dimension d{ni+ns) — 2, which is minimal due to the d{n£-\-ns) 
degrees of freedom in the locations {ti} and structure points {pj} and the two inherent degeneracies 
of translation and scale. 

1.2 Main result 

In this paper, we consider the model where the ni locations and Ug structure points are i.i.d. 
Gaussian, and where pairwise direction observations are given according to an Erdos-Renyi bipartite 
random graph. We show that in a high-dimensional setting, ShapeFit exactly recovers the locations 
and structure points with high probability, provided that and ng are sub-exponential in d, and 
provided that at most a fixed fraction of observations are adversarially corrupted. 

Theorem 1. Let N = max(nf, n^), n = min(n£,ns). Let G{V£ U Vs,E) be drawn from a bipartite- 
Erdos-Renyi graph with p > 0. Take t^^\ ... tn},P^i \ ■ ■ ■ ,Pn} ~ A^(0, Idxd) to he independent from 
each other and G. Then, there exist absolute constants c, C 3 , G > 0 such that for y = csp*^, if 

max (Gd, ^(cg log^ <n<N < 

\C3P^ P J 

and d = H(l), then there exists an event with probability at least 1 — 0{e~ on 

which the following holds: 

For all subgraphs Eh satisfying maxjgj^^j degft(ti) < yn* and max^gj^^j deg^(pj) < 'jni and all pair¬ 
wise direction corruptions Vij G for ij G E^, the convex program (l2|) has a unique minimizer 
equal to {a{tf ^ - C}ie[n,], a{pf^ - C}je[nJ } for some positive a and for ( = (Eie[n,] tf^ + EjeM pf 

This probabilistic recovery theorem is based on a set of deterministic conditions that we prove 
are sufficient to guarantee exact recovery. These conditions are satisfied with high probability in 
the model described above. See Section [Q for the deterministic conditions. 

This recovery theorem is high-dimensional in the sense that the probability estimate and the 
exponential upper bound on -|- Ug are only meaningful for d = H(l). Concentration of measure 
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in high dimensions and the upper bound on ni + Ug ensure control over the angles and distances 
between random points. As a result, lower dimensional spaces are a more challenging regime for 
recovery. 

Numerical simulations empirically verify the main message of these recovery theorem: ShapeFit 
simultaneously recovers a set of locations and structure points exactly from corrupted direction 
observations, provided that up to a constant fraction of the observations at each location and 
structure point are corrupted. We present numerical studies in the physically relevant setting of 
with an underlying random Erdos-Renyi bipartite graph of observations. Further numerical 
simulations show that recovery is stable to the additional presence of noise on the uncorrupted 
measurements. That is, locations and structure points are simultaneously recovered approximately 
under such conditions, with a favorable dependence of the estimation error on the measurement 
noise. 

1.3 Organization of the paper 

Section 11.41 presents the notation used throughout the rest of the paper. Section [2] presents the 
proof of Theorem [TJ Section [3] presents results from numerical simulations. 

1.4 Notation 

Let [k] = {1,..., k}. Let = [n(\ and 14 = [?^s]- Let N = max(n£, rig) and n = min(n£, rig). . Let 
Cj be the rth standard basis element. For a bipartite graph G(14 U 14, F), we write an arbitrary 
edge as an ordered pair (4 j), where i G 14 and j G 14- Let be the complete bipartite graph 

on ni + rig vertices. A cycle of length 4 will be denoted as C 4 . Let E[Kn^,ns) be the set of edges in 
Kni,ns- Let II • II 2 be the standard £2 norm on a vector. For any nonzero vector v, let v = u/||u|| 2 . 
For a subspace W, let Pw be the orthogonal projector onto W. For a vector v, let be the 
orthogonal projector onto the orthogonal complement of the span of {u}. 

Let T denote the set T = for ti G Let P denote the set P = {pj}j^Vs: for pj G 

For i & Vi, j & 14 , define tij = U — pj for all i ^ Vi, j G 14- For i,k G Vi, define tik = U — tk- 
For j,l G Fs, define tji = pj - pi. Define C = + 'Ejev^Pj)- Llehne t[f, T®, C^o), 

similarly. We define poo = maxj^j ||t®|| 2 - For a scalar c and a set of vectors X C let 
cX = {cx : X G X}. For a given G = G(14 U 14, P) and where Vij G have unit norm, 

let R{T,P) = EijdE \\Pvj-Mjh- Let L(T,P) = ■ Let £ij = {tij,Vij), and similarly for 

In this notation, ShapeFit is 

mmi?(r, P) subject to L(T, P) = l, C = 0 

For vectors vi,... ,Vk, let S{vi, ..., Vk) = span(ui,..., v^) be the vector space spanned by these 
vectors. Given tij and define 5ij, pij, and Sij such that 

tij — (It ^ij)£ij T ^ijSij 

where Sij is a unit vector orthogonal to and ijij = ||P(o)xLj|| 2 - Note that rjij > 0. 

2 Proofs 

We will prove Theorem[T]using the same general strategy as in [1|. Specifically, the proof of Theorem 
[H can be separated into two parts: a recovery guarantee under a set of deterministic conditions. 
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and a proof that the random model meets these conditions with high probability. These sufficient 
deterministic conditions, roughly speaking, are (1) that the graph is connected and the nodes have 
tightly controlled degrees; (2) that the camera and structure locations are all distinct; (3) that all 
pairwise distances between cameras and locations are within a constant factor of each other; (4) 
that any choice of two camera locations and two structure locations live in a three dimensional 
affine space; (5) that the camera and structure locations are ‘well-distributed’ in a sense that we 
will make precise; and (6) that there are not too many corruptions affecting a single camera location 
or structure point. Theorem [2] in Section [2.11 states these deterministic conditions formally. 

As in [4], we will prove the deterministic recovery theorem directly, using several geometric 
properties concerning how deformations of a set of points induce rotations. Note that an infinitesi¬ 
mal rigid rotation of two points {ti, tj} about their midpoint to {ti + hi,tj + hj} is such that hi — hj 
is orthogonal to tij = U — tj. We will abuse terminology and say that \\P^±{hi — /ij)|| is a measure 

'^3 

of the rotation in a finite deformation and we say that {hi — hj,ti — tj) is the amount of 

stretching in that deformation. Using this terminology, the geometric properties we establish are: 

• If a deformation stretches two adjacent sides of a C 4 at different rates, then that induces a 
rotation in some edge of the C 4 (Lemma [2]). 

• If a deformation rotates one edge shared by many C 4 S, then it induces a rotation over many 
of those C4S, provided the opposite points of those triangles are ‘well-distributed’ (Lemma 

El). 

• A deformation that rotates bad edges, must also rotate good edges (Lemma |1|. 

• For any deformation, some fraction of the sum of all rotations must affect the good edges 

(Lemma El) • 

By using these geometric properties, we show that all nonzero feasible deformations induce a large 
amount of total rotation. Since some fraction of the total rotation must be on the good edges, the 
objective must increase. 

The main technical difference between the present proof and the proof of [1] is that the proof in 
[1] relies on the presence of many triangles in the graph of uncorrupted measurements. Because of 
the bipartite structure of the present work, there are no triangles in the graph. Hence, the technical 
novelty of the present proof is the establishment of the properties above when there are a sufficient 
number of C 4 S in the graph of uncorrupted measurements. 

In Section 12.11 we present the deterministic recovery theorem. In Section 12.21 we present and 
prove Lemma [2l In Section 12.3( we present and prove Lemmas EHSl In Section 12.41 we prove the 
deterministic recovery theorem. In Section [2.51 we prove that Gaussians satisfy several properties 
with high probability. In Section [2]6l we prove that Gaussians satisfy well-distributedness with high 
probability. In Section 12.71 we prove that Erdos-Renyi graphs are connected and have controlled 
degrees and codegrees with high probability. Finally, in Section [2.81 we prove Theorem [H 

2.1 Deterministic recovery theorem in high dimensions 

To state the deterministic recovery theorem, we need two definitions. The first definition captures 
the ‘regularity’ of the measurement graph. A random bipartite graph can easily be seen to satisfy 
the conditions. Note that the definition does not depend on the vectors locations {tj} and {pj}- 

Definition 1. We say that a graph G(I4 U Vs,E) is bipartite-p-typical if it satisfies the following 
properties: 
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1. G is connected, 

2 . eaeh vertex in has degree between ^UsP and 2nsP, and 

each vertex in Vg has degree between ^n£p and 2n£p. 

3. each pair of vertices in V£ has codegree between \nsP^ and 2nsP^, where the codegree of j,l G 

V£ = \{i\ ij G E{G),il G £^(G)}|. Each pair of vertices in Vg has codegree between \n£p^ and 

2 n£p^. 

The next definition captures how ‘well-distributed’ the location points {ti} and {pj} are in W^. 

Definition 2. 

(i) Let S = {{tk,Pk)}k=i...m C X Let G R'^. We say that S is c-well-distributed with 
respect to {x, y) if the following holds for all h G R'’*; 

'y ^ \\^spa,n{p—x,t—p,y—t}-^{^')\\‘2 — ' IIy)-'-(^) II2 • 

(t,p)eS 

(a) Let T = {ti}i^Vi ^ ~ {Pj}j&Vs- say that {T,P) is c-well-distributed along G if for 
all i £V£,j £ Vg, the set Sij = {{tk,P£) '■ iH G E{G),ki G E{G),kj G E{G),k 7 ^ 7 ^ j} is 

c-well-distributed with respect to {ti,pj). 

We now state sufficient deterministic recovery conditions on the graph G, the subgraph Ei, 
corresponding to corrupted observations, and the locations and 

Theorem 2. Suppose P^^l, Ei,,G satisfy the conditions 


1 . 

2 . 

3. 

I 

5. 

6 . 


The underlying graph G is bipartite-p-typieal, 

All vectors in T(o), p(o) andT(o) are distinct, respectively. 

For all i,k £V£ and j,i G Vg, we have co||t®||2 < ll^j^lb? 

For all i,k G V£,j,i G Vg such that k ^ i,j ^ i, we have 


mm 


P 


do)i 


span(tg\d°))^ ij 


\P 


span(4°AtSV b’ 


( 0 ), 


( 0 )| 

ij 


>/3 


The pair is ci-well-distributed along G, 

Each vertex in V£ (resp. Vg) has at most eng (resp. en£) incident edges in Ei,. 


for constants 0 < p,co, /3,ci,e < 1. Ife < o.nd n£,ng > max(64, ^), then L{T^^\ P^^'i) 7 ^ 0 

and {T^^\P^^i)/L{T^^\P^^'>) is the unique optimizer of ShapePit. 


Before we prove the theorem, we establish that L{T^^\ P^^i) 7 ^ 0 when e is small enough. This 
property guarantees that some scaling of (t(^\ p(V'j jg feasible and occurs, roughly speaking, when 
\Eb\ < \Eg\. 

Lemma 1. If e < then L(T( 0 ),P( 0 )) ^ 0 . 

Proof. Since Vij = for all ij £ Eg, we have 


L(r'»),p<"))= ^ Ii4“>ib- E Ii4“’ii- 

ij&E{G) ij&Eg ij&Et, 

By Condition 3, coPoo\Eg\ < Ylij&Eg ll^^^lb and pioo\Eh\ > J2ij(^Ei, Pij^lb- Thus it suffices to prove 
that colL'gl > |£^b|. As e < |, Condition 1 and 6 gives \Eg\ > ^niUsP — en£ns > \n£nsp. Since 
\Eb\ < eniUs, if e < then we have co\Eg\ > \Eh\. □ 
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2.2 Unbalanced parallel motions induce rotation 

The following lemma concerns geometric properties of deformations of a set of points. Specifically 
it shows that if four points are deformed in a way that differentially scales the lengths of two edges, 
then it necessarily induces a rotation somewhere in a C 4 containing those points. 

Lemma 2. Let d > 3. Let ti,t2T3)^4 £ be distinct. Let tij = ti — tj and iij = Let 

G 1^“* and a G M. Let {5j(j+i)} be such that {vi - Ui+i - a%+i), ti(i+i)) = ||ti(i+i) II 2 

for each i £ [4], where index summation is modulo 4- 


\\Ptf.A'^i-Vi+i)h > 

T’span(i23 ,i4l 

2 

dl2 — ^34 

ie[4] 





T’span(t34 ,*41 )X /12 

2 

dl2 — ^23 

ie[4] 





Proof. The given condition implies P^± (uj — Uj+i) = u* — Uj+i — (a + )tj(j+i) for each i G [4]. 
Therefore, 


\\Ptf.Avi-Vi+l)\\2 

is [4] 


> 


^ ^ ^i +1 T ^i(i+l) 

is [4] 


ie[4] 2 

= ||5l2tl2 + ^23^23 + ^34^34 + ^4lt4l||2- (3) 

(i) Since 534(^12 + ^23 + ^34 + ki) = 0, the right-hand-side of ^ equals ||(5i2 - ^ 34)^12 + {^3 - 
<^ 34)^23 + (<541 — ^ 34)^41 lb- The conclusion follows since 


( 5 i 2 — ^ 34)^12 + {^23 — ^ 34)^23 + (^41 — 534 )f 4 l 


> min ||( 5 i 2 - 534 )ti 2 - st23 - 'ST 41 II 2 

2 s.s'GM. 


'^span(t23,i4l)“*“ (^12 ^ 34)^12 


(ii) Since ^ 23(^12 + ^23 + ^34 + ^ 41 ) = 0, the right-hand-side of (l3|) equals ||(5i2 - ^ 23)^12 + (^34 - 
< 523)^34 + (<541 — (^ 23)^41 lb- The conclusion follows since 


(5 i 2 — ^23)^12 + (^34 — ^23)^34 + (^41 — ^23)^41 > min ||(^12 — ^23)^12 — 5^34 — •5^41 II2 

2 s,s'eR 


Tspan(t34,44i)-'- (^12 ^ 23)^12 


□ 


2.3 C 4 S inequality and rotation propagation 

The following lemma is a generalization of the triangle inequality in a context of the rotational part 
of structure deformations. 


Lemma 3 (C 4 S Inequality). Let d > 4; x,?/ G M'^. Let S = {(ti,pi), • • • , {tk,Pk)} C x If S 
is c-well-distributed with respect to (x, y), then for all vectors hx,hy,ht^, ■ ■ ■ ,htf,, hp.^ , • • • , hp^ G 
and sets X C [k], we have 


\\L’[x-pi)-'-iLx hp^)\\2+\\P(^p^-t^'j±{hp^ Ub||2 +||T(^_j^)x(/ it; hy)\\2 P {ck 

ie[fc]\x 


{hx hy)\\2. 
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Proof. For each i £ [A:], define Wi = span(x — Pi,Pi — ti,ti — y). Define P as the projection map to 
the space of vectors orthogonal to x — y, and define Pi for each i G [A] as the projection map to 
Wj-*-. Since {x — Pi)'^ D IDj-*", {pi — ti)-^ D ^i^; and (ti — y)-*- D W/-, it follows that 

\\^{x-pi)-^ihx ~ ^pjlb + (fipi — ht.)\\2 + ||-Pp^_p)X (/itj — A-p)||2 

ie[A:]\JV 

- X] W^ii^x - hp^)\\2 + \\Pi{hp^ - ht^)\\2 + \\Piiht^ - hy)\\2 > ^ WPiiK - hy)\\2. 

ie[A:]\x ie[fc]\Js: 

Since {(ti,pi), • • • , {tk,Pk)} are well-distributed with respect to {x,y), we have 

\\Pi{hx - hy)\\2 > ck ■ \\P{hx - hy)\\2. (4) 

ie[k] 

Since {x — y)-*- D bFj^, we have \\Pi{hx — hy)\\2 < \\P{hx — hy)\\2 for all i. Hence 

\\P^{hx - hy)\\2 > {ck - |X|) • \\P{hx - hy)\\2, 

ielk]\x 


proving the lemma. □ 

The proof of Theorem [2] will rely on the following two lemmas, which state that rotational 
motions on some parts of the graph bound rotational motions on other parts. The following lemma 
relates the rotational motions on bad edges to the rotational motions on good edges. Recall the 
notation tij = {l + 6 ij)t^^'^ PVijSij where Sij is a unit vector orthogonal to and pij = \\P^io)±tij\\ 2 . 

Lemma 4. FixT,P. If s < ^ and p>^, then 'Zij&E, hij > w hij- 

Proof. Let i G Vi,j G 14. Note that Condition 1 implies \{{k,i) \ k ^ v,i ^ j]i£,k£,kj G E{G)}\ > 
{\nsP — l){\nip^ — 1) > ^n^HsP^ if p > By Condition 6, the number of pairs {k,£) G 14 x 14 
such that at least one of the edges ii, ki, kj are in can be counted by considering the case when 
ii G Ej, (at most (ens)n^ pairs), kj G E^, (at most (en^jug pairs), and ki G Fife (at most engU^ pairs). 
Hence in total, there are at most ‘iensUi such pairs. By Lemma [3l the ci-well-distributedness of 
(j^(o),p(o)) assumption that e < we have 

yy iPii + Vki + Vkj) > f Cl • -^n^nsp^ - Sen^nsj ■ Pij > ^niUsP^ ■ ryj. 
keVt,ieVs ^ ' 

ii^ki^kj^Eg 

Therefore, if we sum the inequality above for all bad edges ij G Fife, then 

E E {riii + me + Vkj) > ^neusp^ ■ mj- 

ij&Ef, keVe/eVs ijeE^ 

il,ke,kjGEg 

For fixed ki £ Eg, the left-hand-side may sum pke many times as the number of C' 4 S in E{G) 
that contain ki and exactly one bad edge. This is the same as the number of C 4 S whose edge 
opposite ki is bad, plus the number of C 4 S whose edge adjacent to i is bad, plus the number of 
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C 4 S whose edge adjacent to k is bad. In each case, there are at most sniUs such C^s. Hence, the 
left-hand-side of above is at most 


E E iVu + Vke + Vkj) < 3en^ns • ^ r]ij. 

ij E-Eb fc E Vs ij EE^ 

ii,ki^kjGEg 


Therefore by combining the two inequalities above, we obtain 


- 

ij&Ef, 


48e 

cip^ 


ij&Eg 


□ 


The following lemma relates the rotational motions over the good graph Eg to rotational motions 
over the complete bipartite graph 

Lemma 5. Fix T,P. If e < ^ and P> then Eij&Eg Vij > fi Vij- 

Proof. Let i G V£,j G I 4 . Note that Condition 1 implies \{{k,i) \ k ^v,i ^ j;ii,k£,kj G L1(G)}| > 
{^Usp — l){^n£p‘^ — 1) > ^niUsP^ if p > Similarly as in Lemma 01 Condition 6 implies that 

the number of pairs {k,£) G x I 4 such that at least one of the edges i£, k£, kj are in Ei, is at most 
SeniUs. By Lemma El the ci-well-distributedness of (T^^\ along G, and the assumption that 

3 

e < we have 

iVii + Vke + Vkj) > f Cl • ^ngusp^ - SeniUsj ■ Vij > ^niUsP^ ■ Vij- 
k&Va&Vs ^ ^ 

ii^ki^kj^Eg 

Therefore, if we sum the inequality above for all z G Vi,j G I 4 , or equivalently over all ij G 
E{Kn^,nf), then 

E E iVit + Vke + Vkj) > ^neusp^ ■ rjij. 

ij&E{Knf,ns) keVl,e&Vs ij^E{K„f,na) 

kj^i/^j 

ii,ki,kj£Eg 


For fixed k£ G Eg, the left-hand-side may sum p^e as many times as the number of paths of 
length 3 in G that contain k£. Each path of length 3 can be thought of as an edge originating from 
Vi, an edge in the middle, and an edge terminating in 14- The total number of paths of length 3 
in G containing k£ equals the number which have k£ as the middle edge, plus the number with k£ 
as the edge originating from I 4 , plus the number with k£ as the edge terminating in I 4 . In each of 
these cases. Condition 1 ensures that there are at most such paths of length 3. Hence, the 

term appears at most 12p^n£ns times. Hence, the left-hand-side of above is at most 

E E iVie + Vki + Vkj) < Up'^neus ■ Vij- 

ij&E(Knf,ns) kS:Vi,e&Vs ij&Eg 

k^i,t^j 

ie,ke,kj€.Eg 


Therefore by combining the two inequalities above, we obtain 


- 

'j.j^E{Kn^.ns ) 


12 • 16 
cip 


ij&Eg 


□ 
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2.4 Proof of Theorem [2] 

We now prove the deterministic recovery theorem. 


Proof of Theorem [M By Lemma[T]and the fact that Conditions 1-6 are invariant under global trans¬ 
lation and nonzero scaling, we can take = 0 and L{T^^\ = 1 without loss of generality. 

The variable /ioo = maxj^j ||tj °^||2 is to be understood accordingly. 

We will directly prove that R(T,P) > P^^^) for all (T,P) ^ such that 

L(r, P) = 1 and i + p = Consider an arbitrary feasible T, P and recall the notation tij = 
(1 -I- + VijSij where Sij is a unit vector orthogonal to and r]ij = \\Ppo)±tij\\ 2 - Since 

Vij = holds for all ij E Eg, a useful lower bound for the objective R{T,P) is given by 


R{T,P) 


X] W^vfMjh — 

ij&E{G) 

> 

> 


i-j&Eg ijeEb 

ij&Eg ij&Eb 


( 0 ), 

ij \ 



ij&Eg ij&Eb 


(5) 


h ^ijeEg Pip by ([5]), we have 


Suppose that EijeEt l^ijMj h < ^ijeEbPip Since Lemma H for e < ^ implies Y.ij&EbPij ^ 


R{T,P) > R(T^^)^pi^))+Y.rJ,g-Y,{\mt^h + P^J) 

ij&Eg ij&Eh 

> R{T^^\P^^^) + Y, Pij - E 2 % > 


ij&Eg ijeEb 


Hence we may assume 

E II 2 > E (6) 

ij&Ef, ij&Eb 

In the case \Eti\ / 0, define 6 = YhijeEb 1*^*11 average ‘relative parallel motion’ on the 

bad edges. For a pair of vertex-disjoint edges ij, M E E{Kni,ns)^ define p{ij, ki) = Pij+Pkj+’nki+Vii, 


Case 0. iPfel = 0 or (5 = 0. 

Note that 5 = 0 implies dij — 0 for all ij E E^, which by ([6|) implies rjij = 0 for all ij E E),. 
Therefore by ([5]), we have 

p(r,p)>p(r(o),p(o))+ ^ 

ij&Eg 

If YlijeEg Pij > 0 ) R(T, P) > R{T^^\ Thus we may assume that T]ij = 0 for all 

ij & Eg. In this case, we will show that T = and P = P^^f 

__ 3 

By LemmaO if e < then rjij = 0 for all ij E E{G) implies that rjij = 0 for all ij E E{Kn^^nJ- 

For ij E Eh, since 6ij = ijij = 0, it follows that ^ij = Since = ^ij — ^ 

we have 

0 = E P - 4“') = E -4’) + E P - 4') = E (««-4’) = E 

ij&E(G) ijeEb ijeEg ij&Eg ij&Eg 
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where the first equality is because L{T,P) = = 1. By Condition 2, ||tj -°^||2 7 ^ 0 for 

all i ^ j. Therefore, if Sij 7 ^ 0 for some ij G Eg , then there exists ab, cd G Eg such that 5ab > 0 
and 5cd < 0. If ah and cd are vertex-disjoint, Lemma [2] and Conditions 2 and 4 force ri{ab, cd) > 0, 
which contradicts the fact that rjij = 0 for all ij G If ab and cd are not vertex-disjoint, 

then, let abc'd' be an arbitrary (74 containing ab and cd. Then Lemma [2] implies the same result as 
above. Therefore 5ij = 0 for all ij G Eg, and hence 5ij = 0 for all ij G E{G). 

Define ti = -|- hi for each i £ Vi. Dehne pj = p^p + hj for j £ Vs. Because r]ij = 6ij = 0 for 

all ij £ E{G), we have hi = hj for all ij £ E{G). Since G is connected by Condition 1, this implies 
hi = hj for alH G V^, j G 14 . Then by the constraint YlieVt + J2j&VsPj ~ hi = 0 for all 

i £ Vi and hj = 0 for all j £ Vs. Therefore T = and P = P^^'>. 

Case 1. \Eij\ / 0 and 5/0 and Ylij&Eg \^ij\ < 

Define = {ij £ E^ : |5i/ > ^5}. Note that Yhij^EVG ^ \^\^b\ and therefore 

E = E l^b-l - E l^bl > E = )[5\Eb\. (7) 

b'sifc ij&Ey ijeEb\Lb ij&Eb 

Dehne Eg = {ij £ Eg : \6ij\ < |5}. Then by the condition of Case 1, 

-S\Eg\ > \Sij\ > \5ij\ > -^5\Eg\Eg\, 

ij&Eg ijeEg\Eg 

and therefore \Eg\ Eg\ < ^\Eg\, or equivalently, |Fg| > 

For each ij £ Lb, dehne Fg{i,j) = {M £ Eg \ k ^ i,(. ^ j{. Note that by Condition 1, 
\Eg{i,j)\ > \\Eg\ — 2p{ni + ns). For any k£ £ Fg{i,j), since \Sij\ > ^6 and \5ki\ < \d, we have 
\dij — dki\ > Thus Lemma[2]and Conditions 3 and 4 give rj{ij,k(.) > /3|5ij — 5ki\ ■ > 

/ • \\dij \ ■ Therefore by Condition 1, 

E E’>(«.«) > E E ^i%i= Ei^i(u)i-^fei 

ij&Eb keeEg ij&EbM&Eg{i,j) ij&Eb 

> ^^^^{^\Eg\-2p{ni + ns)'^\dij\ 

Note that if e < \p, then \Eg\ > — \Eh\ > Further note that n^,ns > 64 implies that 

2p{ni -t- Us) < ^n^nsp. Hence by m, 

E E - ^^32 °° ^ - ^^2 °° 

ij&Eb kl&Eg ijeLb 

For each ij £ E{Kni,ns)i we would like to count how many times each pij appear on the left hand 
side. If ij £ Eij, then there are at most n^ng G^s containing ij; hence r]ij may appear at most 
times. If ij ^ Eb, then rjij appears when there is a C 4 containing ij and some bad edge. If the bad 
edge is incident to ij, then there are at most 2enins such C^s, and if the bad edge is not incident to 
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ij, then there are at most \Eb\ < eniris such ( 748 . Thus r]ij may appear at most A-SeneUs = 12engns 
times. Therefore 


E E ri{ij,k£) < AngUs ■ rjij + ^ 12en£ns ■ rjij . 

ijeE^kieEg ij&E^ ij&E{Kn^,ns) 

_ 3 

By Lemma m if e < we have 

E v^ ... ... 48 • 4e 2046: 

> M) < - ig-ngn^ } , Vij + > , 12en£ns • rnj < - ^ngns 

Cl O'* Cl 

lj&Et,M&Eg ij&Eg ij&E(Kn^,ns) ij&E(Knf,ns) 


Y1 


Hence 


204e 

ClP^ 


-ngris 


^ Vij > 




PcqUc 

64 


'-ngUs ■ S\Eb\. 


If e < 3 g^ 20 ^^ 64 ’ by Condition 3, 5 7 ^ 0, and \Eb\ / 0, the above implies 

. /3coCip3 

/ ^ Vij — nn/f aA^k'OO ' o|.E^6| 


ij^E(Kn^^ns) 


Lemma [5] implies 


204-Gde' 


384 -jr, I 384 ( 0)11 

^ lb- 

ij&E, 


■®ll 

192 I|2- 


ij&Eg 


Therefore by §i),we have Y^ijeEg Vij > Y.ijeE,i\^ij\\\'>^ij^ h + Vij) if e < min{2i|i, H, and 

P > y^- By ([5]), this shows R{T,P) > R{T^^\ This condition on e is satisfied under the 

assumption e < 


Case 2. \Eb\ / 0 and (5 7 ^ 0 and J^ijeEg I'^bl - 

Dehne E^ = {ij & Eg : 6ij > 0} and E- = {ij £ Eg : 5ij < 0}. Since £ij — II 2 for 

ij £ Eg, we have 

0 = E = E««-4“’)+E««ii4“’ii^' 

ij&E{G) ijeEb ij&Eg 


where the first equality follows from L{T,P) = L(r®,pi‘^i). Therefore, 


E ‘^4’ 

70) II 

% 2 

< 

M 

1 

3 

ij&J^g 



ijeEb 


— + Vij) — ‘^PooS\Eb\, 

ijeEt, 


where the last inequality follows from ([ 6 ]), Condition 3, and the definition of <5. On the other hand, 
the condition of Case 2 and Condition 3 implies YlijsEg , |<^iilll 4 i ^||2 > lcoPooS\Eg\. Therefore 

E Mb)lli°^ll2 = ^ (- E ^bl|i®ll2+ E l<^blP®ll2) >UlcoPooS\Eg\-2^^^6\Eb\ 

ij&E. \ ij&Eg ij&Eg ) ^ 
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If e < then since |-Efe| < eniUs and \Eg\ > ggg ^co^oo^\Eg\ — 

2fj.^6\Eb\ > ^coHooS\Eg\. Therefore X]y 6 E_(-<Ijj)||t ®||2 > ^CQHoo'^\Eg\. Similarly, Eije£;+Il4°^ lb > 

'^C()fJ>C)0^\Eg I . 

If |£’+| > ^\Eg\^ then by Lemma [2] and Conditions 3 and 4, we have 


Y1 E > 

ijeE- k£eE+ 

k¥=i,^¥=j 


> 


> 

> 


Note that if e < jp, then \Eg\ > 
2p{ni + rig) < ^niHsp. Hence, 


E E /5(-«.j)ii4“’ii2 

ijeE- k£eE+ 

k¥=i,^¥=j 

Y1 (“'^T)lbl?ll2 -/^d^^+l -2p{ni + ns)) 

ijeE. 

EoPooS\Eg\ ■ l3{\E+\ - 2p{ne + Us)) 

^coPoJ\Eg\[^\Eg\ - 2 p{ni + ns)) . 

I-E’ftl > Further note that ng^ng > 64 implies that 


Y1 E ki) 


ij&E- k£eE+ 

k^i,l^j 


> 


—f3copoo6 


ngUsp 

4 


ngUgp ^ ^cqp CO Srijulp"^ 

16 “ 32 • 64 


Similarly, if |i?-| > ^\Eg\, then we can switch the order of summation and consider Ylij£E+ YlkieE^ 
to obtain the same conclusion. 

Since each edge is contained in at most ngUg copies of C 4 and there are 4 edges in a 6 * 4 , we have 


E E r]{ij,k£) <4ngng rjij. 

ij&E- MeE+ ij&E(Knf,-n,s) 


If e < then since 5 7 ^ 0 and \Eh\ < engUg, we have 


L] - 

ij&E{K„^,n^) 


1 iScoPooS 2 2 2 ^ T ^384 - 

1- QO RA - A QO aA k'ooOngUg > - Pood\Eb\. 

rngUg 32-64 4 - 32-64 cip 


By Lemma El if e < then this implies 


Pij — 292 Eh > ‘2poo5\Ei\. 

^3 '^j ^^{^7i£,7is ) 


Therefore from (El); dS]), and Condition 3, if e < min{^, |, and p > then 

R{T,P) > i2(rW,pW)+ ^ r?,,-- ^(|J,,-|||t®||2 + r?,,-) 


Il2 

ij&Eg ijeEb 

> RiT^^\P^^^) + 2p^6\Eb\ - Y 2|<5*jlll4f II 2 > R{T^^\P^^^). 

ij&Eb 


This condition on e is satisfied under the assumption e < 


PcqcIp* 

384-204-64 ' 


□ 
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2.5 Properties of Gaussians 

In this section, we prove that i.i.d. Gaussians satisfy properties needed to establish Conditions 
3-5 in Theorem [2j We begin by recording some useful facts regarding concentration of random 
Gaussian vectors: 

Lemma 6. Let x,y be i.i.d. M{0, Idxd), cind e < 1, then 

P (d(l - e) < \\x\\l < d{l + e)) > 1 - 

and 

P(|(x,y)| > de) < 

where c> is an absolute constant. 

Proof. Both statements follow from Corollary 5.17 in m, concerning concentration of sub-exponential 
random variables. □ 

Lemma 7. Corollary 5.35 in m- Let A be QjTi tl d Tndtvtx wtth ltd .A/"(0j efitvies. Theft fov 
any t >0, 

p (cTmaxi^) > y/u + Vd + t^ < 2e“V 
where (Tniax(^) is the largest singular value of A. 

Lemma 8. Let ti,tj,tk,ti ^ Ldxd) be independent. There is a universal constant c such that 
with probability at least 1 — 15e“'^'^, 

\\-^spa,n(tii—tj ^i)ll ^ 1 

\\ti-tj\\ ~ 4' 

Proof. Let c be the constant from Lemma 0 Let x = ti — t£,y = tk — tj, z = ti — tj. Observe 

= ^ - (5, x)x - {z, y^±)ya;± = z-{z, x)x - {z, y)y + ((z, y)y - {z, y^i-)y^i.) 

= z- {z, x)x - {z, y)y + {yy^ - y^±yl.±)z, 

where y^± = ; which is well defined with probability 1. By the triangle inequality, 

II '^span(3;,y)-L ^11 — I (-^j x) \ |(. 2 ,y)| \\yy llop 

For arbitrary unit vectors d,b £ ||aa* — 66*||op = | sin0|, where 9 is the angle between d and 

b. This fact can be verified by direct computation after taking a = ei and b = cos 9 ei + sin 9 62 
without loss of generality. Hence, ||yy* — ||op = | sin 9\ = | cos a|, where 9 is the angle between 

y and y^±, and a is the angle between y and x. Thus \\yy^ — yxzyl.±\\op = \{y,x)\. So, 

ll^span(x,y)-L^II > ^ 1 - \{z,x)\‘^ - \{z,y)\ - \{y,x)\ 

Now, note that 

..'12 {ti-tj,ti-tey _ {\\tif - {ti,te) - {tj,ti) + {tj,ti)f 

\\ii - \\ii - 
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By Lemma [6] with e = 0.01 




(d(l + e) + ^deY 
Ad^{l — eY 


< 0.3 


with probability at least 1 — 6e ^ for some universal constant c. Similarly, \{z,y)Y < 0.3 with the 
same probability. Since y and x are independent, by Lemma[6]with e = 0.01, |(y,x)| < < 2e 

with probability at least 1 — 3e“'^‘^. Thus, we observe 

ll^span(.,,)^^ll > Vos - 0.02 > i 

with probability at least 1 — 15e“'^'^. □ 

Lemma 9. Let ti,tj,tk,ti ~ M{0,ldxd) be independent for d>3. There is a universal constant c 
such that with probability at least 1 — 7e~^^, 

ll'^span(ti—^i)ll ^ 1 

Wu-tjh “ 4‘ 

Proof. Let c be the constant from Lemma [H Let u = ^ = tj,x = tk- Each of these 

variables are i.i.d. AA(0,1). Note that 


P. 


spa.n{ti-te,tk-te)- 


(ti tj ) — Pg 


span(w,3i— -^) 


^L^-L W — 


V2J- 


Without loss of generality, rotate coordinates so that u is in the direction of ei. Thus, it suffices to 

.J-) 

V2> 

follow the distribution W(0, ^Id-ixd-i)- Note that 


bound ||P^-_— :^)||2 where v,w,x ^ M{0, Id-ixd-i)- Note that x — and 'bb — both 



2 

V 


— 

w - -= 

V2 




\x — 


J^\\2 

V 2 II 


W — 


V2 


2 {{w,x) - ^{w,v) --^{v,x)+ ^-y 


\x — 


V ||2 

7211 


Hence Lemma [6] with e = 0.01 shows that with probability at least 1 — 6e the above is at 
least 


-(d- 1)(1 -e) - 


(l(d-l)(l + e)-3e(d-l)) 

|(d- 1)(1 -e) 


> (d-1) > -d, 


Thus, we have that 


IP 


spa.n{tj-te,tk-te)- 


- (ti - t 


JJW — 


> -d 


with probability at least 1 — 6e To conclude the proof, note that Lemma[6]with e = 0.01 implies 
that ||ti — tj\Y > 2d(l + s) with probability at least 1 — e~^^. □ 

We can now establish Conditions 3-4 of Theorem [2] with high probability. 
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Lemma 10. Let ti,pj ~ J\f{0,ldxd) for i ^ Vi, j ^ Vs be independent. Condition 3 of Theorem\E 
holds with Co = ^ with probability at least 1 — 2ninse~‘^'^ for a universal constant c. Condition 4 
of Theorem\^ holds with (3 = j with probability at least 1 — 22njn‘^e~‘^'^. 

Proof. Condition 3 follows from applying Lemma [6] with e = 0.01 and a union bound to ||ti — Pj ||2 
for all UiUs pairs (i,j) G Vi x 14. Condition 4 follows from applying Lemmas [8] and [9] and a union 
bound over the at most choices of i, k £ Vi and j, i G Vs. □ 

2.6 Gaussians are well-distributed 

In this section we prove that Condition 5 of Theorem [2] holds with high probability. 

Lemma 11. There exist constants do and Kq such that the following holds. Let G = (14 U Vs,E) 
be a bipartite-p-typical graph. Let ti,pj ~ M{0,ldxd) for i £ Vi, j £ Vs be independent from G and 
each other. Let T = and P = {pj}j&Vs- If d> d^ and ni,ns > max(il'o, lOOd), then {T,P) 

is ^-well-distributed along G with probability at least 1 —for universal constants c, Kq. 

We start by proving an intermediate lemma asserting the well-distributedness of pairs of random 
Gaussian vectors {(b,Pi)}ie[A:] with respect to a fixed pair of random Gaussian vectors {x,y). 

Lemma 12. There exist positive constants do,KQ such that the following holds. Let x,y,ti,pi ~ 
AA(0,/dxd) be independent, where i £ [k]. Then the set -well-distributed with 

respect to {x,y) with probability 1 — Qke~'^^ if k > max(.^ 0 ) fOd,) and d > do- 

The proof of this lemma appears at the end of this section. We will deduce Lemma [H] from 
Lemma [12] by partitioning the edge set of G into sets of vertex-disjoint edges. A matching is a set 
of vertex-disjoint edges. A perfect matching of a graph is a matching that intersects all vertices. 
The following is a well-known lemma in Graph theory. 

Lemma 13. Let G = (y,E) be a bipartite graph with vertex partition 14 = 14 U 14, and let A be 
the maximum degree of G. There exists an edge-partition E = EiD ■ ■ - L) E/\ such that Ea forms a 
matching for each a £ [A]. 

Proof. By adding vertices and edges to G if necessary, we can obtain a A-regular bipartite multi¬ 
graph G'. By Hall’s theorem, every non-empty regular multi-graph contains a perfect matching 
(see [21 Corollary 2.1.3]). Let Fi be an arbitrary perfect matching of G'. Remove Ei from the edge 
set of G', and note that the remaining graph is still regular. Thus we can repeat the process to 
obtain a partition E{G') = Ti U • • • U Ta of the edge set of G' into perfect matchings. The sets 
Ea = EaCi E{G) for a £ [A] satisfy the claimed condition. □ 

The proof of Lemma [TT] follows from the two lemmas above. 

Proof of Lemma [771 Recall the notation that N = max{|14l, 1141} and n = min{|14|, |14|}- Since G 
is a bipartite-p-typical graph, the maximum degree A of G is at most 2Np. By Lemma [131 there 
exists an edge-partition E = EiU ■■ - id E^ such that each Ea for a = 1, 2,... , A forms a matching. 

Fix a pair of indices (4, Jo) for i^ £ Ve and jo £ 14- Let Fi' C Fi be the subset of edges that do 
not intersect io or jo, and for each a £ [A], let E'^ C Ea be the subset of edges that do not intersect 
io or jo. Let A C [A] be the set of indices a for which \E'^\ > max(Firo, lOd). For each a G A, by 
Lemma [T2l we see that with probability at least 1 — 0{\E'^\e~^'^), 

\\I^spa.n{pjQ-ti,ti-pj,pj-tiQ}r{h)\\2 > 

ij&K 
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holds for all h G W^. Therefore by the union bound, with probability at least 1—0(X]aeA — 

1 — 0{\E'\e~^'^) > 1 — 0{nNe~^^), the above holds simultaneously for all a G [A], Conditioned on 
this event, for all /i G M'^, 


Eii^. 


span{pjp -pj ,pj-Ug}-'- 


ijeE' 


> 


E E 11^.. 


spa,n{pj^-ti,ti-pj,pj-ti^^}-^[’T')\\2 > 

a&Aij&E'^ a&A 




Since \E'\ = l-®al) we see that 


\\^spa.n{pj^-ti,ti-pj,pj-ti^}^{h)\\2^ f l-f' I l-^al j ll^lb- (8) 

ij&E' \ a^A / 

Since G is bipartite-p-typical, we have |ili'| > ^nNp — 2{N + n)p > \nNp if n > 16, and by the 
definition of A, we have \^'a\ — ™ax(^ 0 ) lOd) - A < ^nNp if n > 16-max(Xo, lOd). Hence the 

right-hand-side of ([5D is at least ||/i ||2 for all h G This shows that the set {(ti,pj)}i^i^j:^j^ 

is ^-well-distributed with respect to {ti^^,pj^) with probability at least 1 — 0{nNe~'^^). By taking 
the union bound over all choices of pairs {ip, jo) G x I 4 , we can conclude that {T,P) is ^-well- 
distributed along G with probability at least 1 — □ 

We now prove Lemma [T2j 

Proof of Lemma\l‘A Throughout the proof, the positive constant c may change from line to line, 
but is always bounded below by the positive constant of the lemma statement. 

For each i, let Wi = span(tj - y,Pi - x, U - pi) = span(x - y,pi + U - {x + y),ti - pi). Thus 
Pyy± oPf^^_y~^i_ = Ppj/x. Therefore, it is enough to show that for all /i T x — y, with high probability 

n 

Y.\XwAh)h>^n\\h\k. 

Z=1 

Letting Vi = span(x — y,pi + ti — x — y), we have 

Wi = span(x - y,pi + U - x - y,ti - pi) = span(^x - y,pi + ti- x - y,Pyr{ti -pi)'^. 

Now, for any h E {x — y), 


i=l 2 


Y1 


2 > 


2 = 1 


2=1 

Y,(pvph)-Pp 


> 


> 


YPAh) 

2=1 

n 

'pPvME 




2=1 


2=1 


-E 

2 1=1 


Pp ~{ti-pi){h) P(^ti-pi){P) 
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Since \\Py{h) — Pwih )\\2 < \\vP — HopH^lb < ||^ — iwlbll^lb holds for all vectors v,w,h ^ M“, 
the above is at least 




2 = 1 


E 

2 = 1 


Pi) (ti-pi) 


\\Pv.^iti-Pi)h Wu-Pih 




2 = 1 


(9) 


Note that when v = P{w) for some orthogonal projection operator P, we have 


\v — w \\2 = 2{1 — {v,w)) = 21 1 — 


{P{w),w) 

||P(tc)||2||u;| 


= 2 1 - 


ll^lb 


Thus, 


Pv^^(^i Pi) [ti-pi) 


WPy.^i'^i - Pi)h WU-Pih 


= V2\ l- 


WPy.^iti - Pi)h 


-Pih 


Hence ([9]) is at least 

n 

En;Uft) 

L 

n 

^PvUh) 

2 = 1 


E 


2=1 


Py^ 

i 

i - Pi) 

(ti - Pi) 

1 

\\Pyy (L 

z 

-Pi)h 

WU-Pih 

2 miniditi-pilli) 


- Pi)iti - PiT 


2 = 1 


op 




WPy.^i^i - Pi)h 


i=l 


-pih - piWl) 


- Pi){ti - PiY 
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We will now expand the first term, || Py.^ (^) lb- Let u = x — y and Zi = x + y — {pi + ti). We 
have 


PVi^W = Ps{u,zip{h) 

= Ps{u,p^^{zi))^{h) 

= h- Pu{h) - Pp^^(2,)(/i) 

= h-Pp^^(,^)(/i) + P,,(/.)-P,,(/i) 
= P,x(h)-Pp^^(,,)(/i)+P,,(/i), 

where we used /i T u in the fourth inequality. Thus, 
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Letting X, = , Y, = ^ 

have shown that for any h J- x — y, 

n n 


y)^{^+y U PQIb _ Iiy^n /, _ 

\x+v-u-vi\\i ’ “ ll2_/i=ilL Pi) 


l|a;+?;-ii-pi||2 

ih) 


lop 


, we 


i=l 


i=l 


'^V2\^yi-Xi + ^yi-Yi 


2 = 1 


mini(||ti -pi\\i 




( 10 ) 


We will separately bound the first term and last two terms with high probability. 

We now show that the first term of (jlOp is bounded below by 0.3n||/i||2 with high probability. 
Because L + Pi ='^ \/2L, it suffices to show that with high probability 


P{x+y-V2ti)^(.^) 


2=1 


> 0.3n||/i||^ 


Let V = X + y and Wi = —y/2ti. Note that 
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where in the last inequality we used 
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Now, let A = X)r=i ^ We have 
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Thus, 
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n\\v\\l + 2 ||u ||2 lEILi 

miiij ||u + Wi \\2 


Now, consider the event 
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min u + tCj 2 ^ 4(i/3i, u 2 < 2(i/32, 

1 * 
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< 2ndP3, crmax(A)^ < 2n/?4 


On El we have, 
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4d/3i 


= IK'-II2 
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= IK'-||2 
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Now, let /3i = 1 - /32 = /33 = 1 + 3 ^, /34 = ^dPi. This gives 


1 /32 04 


= 1/10 A 

A v^' 


Assuming n > 550, we see that on Ei, 

n 

^ \\P(:^+y_ti-pi)^{h) > 0 . 3 n||/i|| 2 . 


2=1 


Now, we bound P(£'i). Note that |||u + ='^ ^ll^lli ^ IIZ]r=i '*^*112 X^{d) and is 

a random n x d matrix with i.i.d. AA(0,1) entries. Thus, by applying Lemma [ 6 l we have 
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^4d(l — e) < \\v + Wi \\2 < 4d(l + e)^ > 1 — e 
P (2d(l - e) < ||u||i < 2d{l + e)) > 1 - 

n 

E* 




P 2nd(l - e) < 


i=l 


< 2nd{l + e) > 1 — e 


where c > 0 is a universal constant. Also by taking t = 2y/d in Lemma [7] we get 
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whenever ^fn + 3\/d < \/n^, or equivalently (a/ fyy/d — ^)^/n > Sy/d, which holds when n > 550 


and d > 10. Thus for n > 550, we have 


—cd 


IP(T'i) > 1 — 2 ne 

We now show that the second term of (| 10 p is bounded above by 0 . 2 n||/i ||2 with high probability. 
Dehne the event 


E2 = {Xi> I- 
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> 1 - 
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1 


800’ mmi{\\ti-pi 


On E 2 , we have 




2=1 


+ 


minj(||ti -piWi 


T^Zi < O.ln, i = 1, 2 ... n 


-Z,||h||2 <0.2n||h||2. 


We now estimate P(£' 2 ). For Xi, since {U — pi) is independent from {x — y,pi + ti — x — y), we 
can view the latter as fixed. That is, by conditioning on Vi, and applying a rotation R such that 
R{Vi) = span(ei,e 2 ), we have 


-Pi)h 




Elztuur 


where ti{j) is the jth entry of t*. As Yl'j=i ~ Xd -2 ~ Xd’ Lemma [ 6 ] can 

be repeatedly applied to give P(Aj > 1 — for all i) > 1 — 2ne~^'^. A similar argument gives 
> 1 — ^ for all i) >1 — 2ne~'^^ because x — y and x + y — {ti + pi) are independent. 

We now bound the probability of the third condition in the dehnition of E 2 . Note that 


minj(||tj -pj|| 2 ) 


'^{ti-Pi){ti - Pi) 


2=1 


op 


minidltilli) 




2 = 1 


op 


Let B = Y{Ji=i 6*^1 ■ By Lemma [3 || Ya=i Li* Hop = o-max(-B)^ > n( 1 + 3w - ) with probability at 


least 1 — 2e . By Lemma[ 6 l ||ti|l 2 — i with probability at least 1 — ne We 

conclude 
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miniditjlli) 
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n 1 + 3 


op 


d{l — e) 


with probability at least 1 — 2ne If e = 0.01 , d ^ 40, Ti ^ lOd, we have 
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Zi < O.ln ) > 1 — 2ne 


—cd 


^minjdltj -Pi\\l) 

Hence, if d > 40, n > lOd, 

P(-F2) > 1 - 6ne"'='^. 

In conclusion, there exist positive integers do and no such that for all d > do, n > no, n > lOd, 
and all h T X — y. 
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T.PwA’') 


2=1 


> 3 ^ 11 ^ 112 ) > l-P[(^inF; 2 r] > l- 6 ne 


—cd 


for some c > 0 , which implies the statement of the lemma. 
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2.7 Random graphs are p-typical with high probability 


We prove that Condition 1 of Theorem [5] holds with high probability. 


Lemma 14. There exists an absolute constant c > 0 such that for all positive real numbers p <1 
satisfying n 2 P > 21og(eni) and nip > 21 og(en 2 ), G(ni,n 2 ;p) is p-typical with probability at least 
1 - nin22”i+”2e-P’"i”2/4 - n?n2e-^(’"2p2) - . 

Proof. Let Vi and V 2 be vertex sets of sizes |Vi| = ni and IV 2 I = re 2 . Throughout the proof, we 
let Vi U V 2 be the bipartition of the random graph G(ni,n 2 ;p). The bipartite graph G{ni,n 2 ',p) 
is not connected only if there exist partitions Vi = Vi,i U Vi ^2 and V 2 = V 2 ,i U 1 / 2,2 such that 
the sets Vi^i U ^ 2,1 and Vi ^2 U V 2,2 are both non-empty and have no edges between them. Let 
|W,i| = ki, 11^2,11 = k 2 , 1 14 , 2 ! = ni — ki and 1 1 / 2 , 2 ! = n 2 — k 2 . For fixed ki,k 2 , by the union bound, 
the probability that there exists a partition as above is at most 




( 11 ) 


If /ci < ^ and k 2 < then by Stirling’s formula, (fTT]) is at most 

If A:i > ^ and k 2 > ^, then let = ni — ki and £2 = ^2 — ^ 2 - Then (flTT) is at most 
(^^y (1 -p)(^l-2+/2ni)/2 < ^^^-n2p/2y (^^^-mp/2y 


If (^1 < ^ and k 2 > or {ki > ^ and k 2 < ^), then, by (^) < 2” for all 0 < /c < n, (fTTIl is at 
most 

2^1-1-712^]^ _p'^nin2/4: ^ 

Hence the probability that G{ni,n 2 ;p) is disconnected is at most 






ki 



k2 



+ nin22’^l+'‘2g-pnin2/4^ 


6712 . 







l^^g-nip/2 


where the indeterminate factors in the sums corresponding to ki = 0, A :2 = 0, G = 0, or .^2 = 0 
are taken to be unity. Since n 2 P > 21og(e77i) and nip > 21og(e772), the four sums above are 
maximized at {ki,k 2 ) = (1,0), (0,1), {ii,i 2 ) = (1;0),(0,1), respectively. Therefore the probability 
that G{ni,n 2 ',p) is disconnected is at most 

2771772 • 6771 • ^ 2771772 • 6772 • + 77l7722’^l+"2g-pnin2/4^ 


For a fixed vertex u G Fi, the expected value of deg(7;) is n 2 P, and for a pair of vertices v,w G Vi, 
the expected value of the codegree of v and w is n 2 p‘^. Therefore by Chernoff’s inequality (see Fact 
4 from [T]) and a union bound, the probability that all vertices in Fi have degree between ^772P 
and 2 n 2 P, and all pairs of vertices in Fi have codegree between ^772P^ and 2 n 2 P^ is 1 — 

Similarly, the probability that all vertices in F 2 have degree between ^77ip and 2nip, and all pairs 
of vertices in F 2 have codegree between ^77ip^ and 277ip^ is 1 — The conclusion follows 

by taking a union bound over all events. □ 
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2.8 Proof of Theorem [T] 

We can now prove the high dimensional recovery theorem, which we state here again for convenience: 

Theorem 1. Let N = max(ni,ns),n = min(nr,ns). Let G{Vi U Vs,E) be drawn from a bipartite- 
Erdos-Renyi graph with p > 0. Take t^^\ ... tn^ ,pf‘\ ■ ■ ■ ,Pns ~ A^(0, Idxd) to be independent from 
each other and G. Then, there exist absolute constants c, C 3 , G > 0 such that for 7 = c^p"^, if 

max f Gd, AT)^ < „ < AT < 

\C3P^ p J 

Of 1 ~l /2 l/ 2 \ 1 j 

and d = ^( 1 ), then there exists an event with probability at least 1 — 0 {e~ ^ ) _)_g- 2 “)^ on 

which the following holds: 

For all subgraphs Ef, satisfying maxjgj^^j degft(tj) < 7 ns and maxjgj^^] degb(pj) < 'yni and all pair¬ 
wise direction corruptions Vij € for ij G E^,, the convex program (I5|) has a unique minimizer 

equal to {a{tf ^ - C}ie[n,], a{pf ^ - C}je[ns]} for some positive a and for ( = (Eie[n,] tf^ + EjeM pf 

Proof. Let c be minimum of the constants from Lemmas [TU] and [TTJ Let Kq be the constant from 
LemmadlJ It is enough to verify that G, T and Ei, in the assumption of the present theorem satisfy 
the deterministic conditions 1-6 in Theorem [2l with appropriate constants p,/3, cq, e, ci, and with 
the purported probability. By Lemma dH Condition 1 holds with probability at least 

1 - = 1 - 0 {N^e-^^^P^^) 

if np > 2log{eN). Condition 2 holds with probability 1. By Lemma [TOl Condition 3 holds for 
Co = ^ with probability at least 1 — 2nrnse“‘^'^, and Condition 4 holds for /3 = | with probability at 
least 1 —22n|nge“‘^'^. By Lemma fTTl Condition 5 holds for ci = ^ with probability 1 —0(n|nse“'^'^) 
if n > max(iLo) 160d), and d > do. Thus, Conditions 1-5 hold together with probability at least 

1 - 

Take 7 = csp'^ < Because 7 < Theorem [2] implies that recovery via ShapeFit 

is guaranteed. Note that the conditions rnaxigy^ degb(i) < 7 ns and maxjgv"s degft(j) < 'yuf, are 
nontrivial when p > c^ Using this inequality, we have N^e~^^^P^^ < < 

g“^( 2'^3 ^ if n = n(c 3 log^ ES') and N'^e~^'^ < if N" < gs'^'^. Thus, the probability of exact 

recovery via ShapeFit, uniformly in Eb and Vij satisfying the assumptions of the theorem, is at 

1 _ C)(g-f^(^3+ g-l'^'^*). □ 


3 Numerical simulations 


In this section, we use numerical simulation to verify that ShapeFit recovers Gaussian camera 
locations and Gaussian structure locations in in the presence of corrupted pairwise direction 
measurements. Further, we empirically demonstrate that ShapeFit is robust to noise in the uncor¬ 
rupted measurements. 

Let if^'^ G be independent Af( 0 , 73 x 3 ) random variables for i = 1,... ,ni. Let p® G be 

independent Af(0, 73 x 3 ) random variables for j = 1,..., Ug. Let 


ho) ^ ho) 
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Ui n, 




r(0) 


+ 
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A ( 0 ) ~( 0 ) 

and p) = p) — 
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ne n, 




r(0) 


+E#r 
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Let the graph of observations G be a bipartite Erdos-Renyi graph G{ni,ns,p) on rii + ng vertices, 
for p = 1/2. For ij E E{G), let 




Zij 



with probability q, 
with probability 1 — q, 


where Zij are independent and uniform over Let Vij = Uij/||uij||2- That is, each observation is 
corrupted with probability q, and each corruption is in a random direction. In the noiseless case, 
with cT = 0, each observation is exact with probability 1 — q. 

We solved ShapeFit using the SDPT3 solver P [9] and YALMIP [5]. For output S = {T,P) = 

{{ti}ielni],{Pj}jeln,]), define its relative error with respect to 5^°) = 
as 


s 

Sio) 


||5(»)||f 


where ||5'||_p is the Frobenius norm of the matrix whose column are given by {U} and {pj}- This 
error metric amounts to an (.2 norm after rescaling. 

Figure [T] shows the mean relative error of the output of ShapeFit over 10 independent trials for 
locations in generated by p = 1/2, = Ug, cr E [0, 0.05], and a range of values 10 < + ng < 70 
and 0 < g < 0.5. White blocks represent zero average relative error, and black blocks represent 
an average relative error of 1 or higher. Average residuals between 0 and 1 are represented by the 
appropriate shade of gray. The figure shows that ShapeFit can empirically recover 3d locations 
in the presence of a surprisingly large probability of corruption, provided n is big enough. For 
example, if n > 50, ShapeFit outputs a structure with small relative error even when around 
15% of all measurements are randomly corrupted. Further, successful recovery occurs both in the 
noiseless case, and in the noisy case with a = 0.05. 

Figure [2] shows the median residual over 10 independent trials for locations in generated by 
p = 1/2, ni = Us = 25, q = 0.1 and a range of values of 10“® < cr < 10^. We see that ShapeFit 
is empirically stable to noise, with median residuals that are approximately linear in the noise 
parameter a. 
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Corruption probability (q) 


Corruption probability {q) 


Figure 1: Mean recovery error of ShapeFit as a function of the number of locations n£ + Ug and 
the corruption probability q. The data model has 3d Gaussian locations whose pairwise directions 
are observed in accordance with a bipartite Erdos-Renyi graph G{ne,ns,l/2) and are corrupted 
with probability q. White blocks represent an average relative error of zero over 10 independently 
generated problems. Black blocks represent an average relative error of 100%. The left panel 
corresponds to the noiseless case <7 = 0, and the right panel corresponds to the noisy case a = 0.05. 


ShapeFit for = rig = 25, g = 0.1 



Noise parameter a 

Figure 2: Median recovery error of ShapeFit versus the noise parameter a. These simulations are 
based on 50 Gaussian locations in whose pairwise directions are observed in accordance with a 
bipartite Erdos-Renyi graph G(25, 25,1/2) and are corrupted with probability q = 0.1. The median 
is based on 10 independently generated problems. 
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